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Abstract 


The  emphasis  is  on  development  of  likelihood  ratios  and  detection 
algorithms  for  problems  involving  nonGaussian  data.  The  first  problem 
considered  is  that  of  detecting  a  nonGaussian  signal  in  Gaussian 
noise.  This  frequently  arises  in  active  sonar;  it  could  also  be 
important  for  passive  sonar.  General  results  are  presented  on  non¬ 
singular  detection  and  likelihood  ratio.  A  recursive  discrete-time 
detection  algorithm  is  obtained  and  is  shown  to  be  a  likelihood  ratio 
detector  when  the  signal-plus-noise  is  Gaussian. 

The  second  major  problem  considered  is  tha t  of  detecting  a  signal 
in  spherical ly-invariant  noise  (SIN).  This  is  a  model  which  has  been 
proposed  for  some  i  it  pu  1  s  i  ve-p  1  us-Gau  s  s  i  an  environments,  and  is  closely 
linked  to  detection  problems  encountered  in  some  active  sonar  appli¬ 
cations.  General  results  on  nonsingular  detection  and  likelihood  ratio 
are  first  obtained.  For  detection  of  a  known  signal,  the  behavior  of 
the  d i sc r e t e- t i me  likelihood  ratio  is  analyzed  as  the  sample  size 
increases.  Const ant-fal se-a larm-pr obab ility  detectors  are  given,  and 
an  example  based  on  sonar  data  illustrates  the  potential  loss  due  to 
using  a  Gaussian  model  when  the  noise  is  actually  nonGaussian  SIN. 


LIKELIHOOD  RATIOS  AND  SIGNAL  DETECTION 
FOR  NONGAUSSIAN  PROCESSES 
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1.  INTRODUCTION 

NonGaussian  signal  detection  problems  arise  in  several 
applications  of  underwater  acoustics.  NonGaussian  signal  processes 
occur  for  active  sonar  when  the  reflecting  target  (with  surface 
undergoing  random  motions)  has  only  a  few  dominating  scatterers.  The 
noise  in  such  applications  is  frequently  Gaussian,  so  that  the 
detection  problem  is  that  of  detecting  a  nonGaussian  signal  embedded 
in  additive  Gaussian  noise. 

Problems  of  detecting  a  signal  in  nonGaussian  noise  also  arise; 
for  example,  for  sonars  operating  under  ice.  Noise  due  to 
ice-cracking,  creaking,  floe-smashing,  etc.,  contributes  a  component 
which  has  been  found  to  have  substantial  nonGaussian  behavior  [6],  In 
addition,  active  sonars  operating  under  ice  near  the  surface  may 
encounter  a  nonGaussian  component  due  to  specular  reflection  from  the 
irregular  under-ice  surface.  Another  environmental  situation  which  may 
produce  nonGaussian  noise  is  shallow-water  reverberation. 

Optimum  detection  algorithms  require  knowledge  of  the  statistical 
properties  of  data  processes.  For  applications  involving  nonGaussian 
noise  with  a  strong  impulsive  component,  a  useful  univariate  noise 
model  has  been  developed  by  Middleton  [12].  Much  remains  to  be  done 
in  this  area.  Development  of  optimum  detection  algorithms  requires 
knowledge  of  multivariate  statistical  models  for  both  the  noise  and 
the  s i gna 1 -p 1  us -no i se  processes.  At  present,  such  models  do  not  exist 
for  some  of  the  most  important  nonGaussian  environments.  Their 
development  will  require  a  mix  of  physics,  mathematics,  statistics, 
and  extensive  computational  investigations.  These  are  challenging 
problems  whose  solution  must  be  obtained  before  one  can  obtain  optimum 
detection  algorithms. 

This  contribution  first  considers  algorithms  for  the  detection  of 
nonGaussian  signals  in  Gaussian  noise.  Results  are  summarized  for  the 
continuous-time  problem;  more  attention  is  given  to  discrete-time 
approximations.  A  discrete-tim  ;  recursive  algorithm  is  given.  It  is 
shown  that  (under  appropriate  assumptions)  this  d i sc  re t e- t i me  algor¬ 
ithm  is  a  likelihood  ratio  detector  if  the  signal-plus-noise  process 
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is  Gaussian.  Attention  is  then  turned  to  signal  detection  for  problems 
involving  nonGaussian  spherical ly-invariant  noise  (SIN).  The  uni¬ 
variate  Class  A  model  of  Middleton  [12]  is  seen  to  be  a  special  case 

of  SIN.  The  likelihood  ratio  for  detection  of  a  signal  in  SIN  is 

derived  for  both  con t i nuous- t i me  and  d i sc r e t e- t i me  applications. 
Approximations,  including  constant  false-alarm  probability  (CFAP) 
detectors,  are  discussed.  The  effect  of  sample  size  is  also 
considered.  These  results  indicate  tl  at  robust  detection  can  be 
achieved  for  detection  of  known  signals  in  spher i ca 1 ly- i nvar ian t 

noise.  In  applying  these  results  to  Middleton's  Class  A  model,  it  is 
shown  that  placing  that  model  in  the  context  of  SIN  provides  a  number 
of  useful  consequences. 

All  stochastic  processes  to  be  discussed  are  real-valued  and 
defined  on  a  probability  space  (Q.  0,  P).  For  processes  in  continuous 

time,  the  parameter  set  is  [0 , T] ,  and  all  such  processes  are  assumed 
to  be  mean-square  continuous.  All  noise  processes  are  assumed  to  have 
zero  mean.  ( V ( t ) )  will  denote  a  stochastic  process,  while  V(t)  will 
denote  the  random  variable  obtained  by  sampling  the  process  at  the 
time  t.  The  argument  cj  in  ft  will  typically  be  suppressed: 
V(t)  =  V(t,<j),  ^[O.T]  is  the  linear  space  of  all  Lebesgue-square- 

integrable  functions  on  [0.T],  L2[0,T]  is  the  set  of  equivalence 

classes  [u]  obtained  from  functions  u  in  ^[O.T].  For  a  noise  process 
(N(t)),  r^  will  denote  the  covariance  function:  r^(t,s)  =  E  N(t)N(s). 

will  denote  the  covariance  operator  of  ( N { t )  )  :  that  is,  the 
integral  operator  in  L2[0,T]  having  r^  as  its  kernel.  will  be 

assumed  strictly  positive;  (A^)  is  the  sequence  of  (strictly  positive) 
eigenvalues  of  R^ ,  with  (en)  corresponding  c.o.n.  eigenvectors. 
T 

<u,v>  =  P  u(t)v(t)dt  for  u  and  v  in  L_[0,T].  The  s i gna 1 -p 1 u s -no i se 

J  0 

process  at  time  t  will  be  Y(t)  =  S(t)  +  N(t).  In  the  con t i nuous- t i me 
case,  the  likelihood  ratio  sought  is  on  L^CO.T]:  dp^/dp^ ,  where  p^ 
(resp.  ,  p^)  is  the  probability  on  LgCO.T]  induced  by  the  stochastic 
process  (Y(t))  (resp.,  (N(t))). 

For  a  noise  (N(t))  with  covariance  function  r^,  will  denote 

the  reproducing  kernel  Hilbert  space  of  r^ .  As  is  well-known,  there 

j_ 

is  an  isometry  between  and  range(R^):  the  element  u  is  in  if  and 

only  if  there  exists  a  unique  element  [u]  in  range(R^)  generated  by  u. 

The  inner  product  of  two  elements  u  and  v  in  is  given  by 

[ u . v ] M  =  I  <[u],e  ><[v],e  >/X  .  Since  rM  is  taken  to  be  continuous. 
N  .  u  n  n  n  N 


the  elements  of  will  be  continuous  functions.  It  can  also  be  noted 

that  range(R^)  is  a  real  separable  Hilbert  space  under  the  above  inner 
product  (i.e.,  with  respect  to  the  inner  product  ([u].[v])^  =  [u,v]^). 
For  simplicity,  the  element  [u]  in  LgfO.T]  will  usually  be  written 
simply  as  u. 

For  observations  x  in  n-d i men s i ona 1  Euclidean  space  En ,  the  noise 
covariance  matrix  will  be  assumed  strictly  positive.  A*  is  the 
transpose  of  the  matrix  A.  For  a  bounded  linear  operator  A  in  L^fO.T], 
Aw  will  denote  the  adjoint. 

In  discussing  existence  of  likelihood  ratios  for  cont inuous- t ime 
processes,  it  is  necessary  to  introduce  more  mathematical  structure. 
Thus,  let  (X(t)},  t  in  [0,T]  be  a  m.s.  continuous  stochastic  process. 

a{Xs>  s<t}  is  the  a-field  generated  by  {Xs,  s<t};  ct^(X)  is  the  filtra¬ 
tion  consisting  of  all  the  o-fields  ct  { X^  ,  s  <  t  '  }  for  0  <  t’  <  t.  o^ ( X ) 
will  denote  a~(X)  .  at(X)  (resp.  ,  a(X))  will  denote  the  filtration 

generated  by  o^(X)  (resp. ,  ct^(X))  and  all  sets  of  P-measure  zero,  com¬ 
pleted  with  respect  to  the  underlying  probability  P.  If  (X(t))  and 
( V( t  )  )  are  two  such  processes,  then  2t(V)  v  ot(X)  will  denote  the 
smallest  filtration  containing  both  ot(V)  and  ct  (X)  ,  with  a(V)  v  a(X) 
similarly  defined. 

The  con t i nuou s - t i me  problems  considered  here  will  be  modeled  in 
1. 2  [  0 ,  T  J  .  Somewhat  similar  results  can  be  obtained  by  considering  the 

ro  ti 

probabilities  induced  on  RL  ’  ,  the  real-valued  functions  on  [0,T]. 
However,  those  results  [5]  are  not  so  complete  as  those  for  L^fO.T]. 

For  fiy  and  p^  probabilities  on  the  Borel  sets  of  LgfO.T], 
liy  <<  p^  denotes  absolute  continuity  of  Py  with  respect  to  p^  (so  that 
the  likelihood  ratio  dpy/dp^  exists).  Py  1  p  denotes  orthogonal 
probability  measures;  in  detection  applications,  orthogonal  measures 
imply  singular  (perfect)  detection.  Py  p^  denotes  mutual  absolute 
continuity:  py  <<  p  and  p^  <<  Py  .  P^  will  denote  probability  of 
detection  =  probability  of  correctly  deciding  signal  present.  P 

r  A 

denotes  probability  of  false  alarm  =  probability  of  incorrectly 
deciding  signal  present.  Pp^  =  0  implies  P^  -  0  if  p^  +  ^  <<  p^  ;  P^  =  1 
implies  Pp^  =  1  if  p^  <<  p^  +  ^.  Thus,  p_,  +  ^  ^  p^  is  the  situation 
usually  assumed  to  hold  for  practical  problems  in  signal  detection.  We 
refer  to  this  as  non -singular  detection. 

Treatment  of  the  continuous-time  case  introduces  substantial 
complication  into  the  analysis.  However,  it  clarifies  structure. 
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enables  one  to  obtain  discrete-time  finite-sample  algorithms  as 
approximations,  and  provides  valuable  performance  bounds. 

2.  DETECTION  OF  NONGAUSSIAN  SIGNALS  IN  GAUSSIAN  NOISE 

An  important  active  sonar  application  is  the  detection  of 
nonGaussian  signals  embedded  in  additive  Gaussian  noise.  For  example, 
when  the  noise  background  is  reverberation-1 imi ted ,  the  scatterers 
giving  rise  to  the  reverberation  can  frequently  be  assumed  to  be 
s t a t i s t i ca 1 1 y- i ndependen t  in  their  reflecting  properties.  Application 
of  the  central  limit  theorem  then  gives  a  Gaussian  process  for  the 
reverberation  process.  However,  if  the  target  return  is  primarily  due 
to  reflections  from  a  few  random  scatterers  (each  contributing  random 
phase  and  amplitude),  then  the  composite  reflection  from  the  target 
will  generally  be  nonGaussian.  In  this  particular  application,  the 
signal  and  noise  processes  are  dependent,  and  the  noise  process  is 
nons  ta  t ionary . 

Other  applications  may  also  involve  detection  of  nonGaussian 
signals  in  Gaussian  noise.  For  example,  in  passive  sonar  the 
background  noise  can  frequently  be  assumed  to  be  Gaussian  and 
stationary.  However,  signal  sources  such  as  sh i p-rad i a t ed  noise  need 
not  be  Gaussian. 

Full  solution  of  such  problems  ideally  includes  determining  con¬ 
ditions  for  nonsingular  detection,  and  then  (when  nonsingular 
detection  holds)  determining  the  likelihood  ratio. 

If  the  s igna 1 -p 1 u s-no i s e  process  is  also  Gaussian,  then 
conditions  for  nonsingular  detection  and  the  form  of  the  likelihood 
ratio  are  well  known  [4,  14].  If  the  signal  is  nonGaussian  and 
independent  of  the  noise,  then  sufficient  conditions  for  nonsingular 
detection  are  given  in  [3],  With  the  noise  Gaussian,  the  sufficient 
condition  is  that  the  sample  paths  of  the  signal  process  belong  (w.p. 
1)  to  H^  ,  the  reproducing  kernel  Hilbert  space  of  the  noise.  Under 
mild  assumptions,  an  expression  for  the  likelihood  ratio  can  also  be 
obtained  from  the  results  of  [3]. 

If  nothing  is  known  about  the  signal-plus-noise  process  except 
its  covariance  and  mean  functions,  then  of  course  a  likelihood  ratio 
detector  cannot  be  determined.  However,  if  one  limits  consideration  to 
quadratic-linear  operations  on  the  data  (in  forming  a  test  statistic), 
then  the  deflection  criterion  can  be  used  to  determine  the  optimum 
operation.  That  is,  let  T  be  the  class  of  all  admissible  test 


statistics  t.  The  deflection  of  t  is  then  Dq^(t) 

=  (Ent(x)  -  es+nt(x) )2/(Ent2(x)  -  [Ent(x)]2),  where  EN(-)  (resp.. 

Es+n(*))  denotes  expectation  w.r.t.  the  noise  (resp.,  signal-plus- 
noise).  The  problem  then  consists  of  determining  sup^.  Dq^(t),  and 
determining  a  t  achieving  this  supremum  or  a  sequence  (t^)  converging 
to  the  supremum.  The  optimum  quadratic  operation  for  d i sc r e t e- t i me 
f i n i t e- samp  1 e  data  is  given  in  [1],  while  [2]  contains  the  solution 
for  the  infinite-dimensional  case  and  results  linking  deflection  to 
nonsingular  detection. 

The  results  on  deflection  given  in  [1]  and  [2]  apply  to  problems 
where  the  s i gna  1  -p 1 u s -no i s e  process  is  neither  Gaussian  nor  consisting 
of  the.  noise  plus  an  independent  signal  process.  For  such  problems,  it 
is  desirable  to  have  general  conditions  for  nonsingular  detection  and 
also  expressions  for  the  likelihood  ratio.  Current ly-avai lable  data 
models  may  be  inadequate  to  fully  utilize  such  results,  but  their 
availability  for  future  use  is  clearly  desirable.  Results  for  the 
special  con t i nuous- t ime  case  when  the  noise  is  the  Wiener  process  have 
been  known  for  many  years  [11]-  However,  the  Wiener  process  has 
properties  that  are  not  observed  in  practical  sonar  problems:  sample 
functions  that  are  almost  surely  nond i f f erent iab le  at  almost  all  time 
points,  the  Markov  property,  and  the  martingale  property.  Thus,  the 
design  of  future  optimum  signal  detection  systems  requires  results 
beyond  those  already  mentioned;  such  results  have  recently  been 
obtained  [5]. 

The  results  contained  in  [5]  include  general  conditions  for 
nonsingular  detection  of  a  possibly  nonGaussian  signal  imbedded  in 
additive  Gaussian  noise.  The  work  is  based  on  the  spectral 
representation  of  second-order  stochastic  processes,  particularly  as 
developed  by  Hida  [10].  The  general  problem  is  that  of  discriminating 
between  a  Gaussian  noise  process  (N(t)),  t  in  [0,T],  and  a  possibly 
nonGaussian  process  ( Y ( t  )  )  .  t  in  [0,T]. 

The  basic  assumptions  made  in  [5]  are  the  following: 

(A. 2-1)  (N(t))  vanishes  almost  surely  at  t  =  0; 

(A. 2-2)  (N(t))  has  a  pu r e 1 y-con t i nuou s  spectral  representation  of 

multiplicity  M  <  « . 

Assumption  (A. 2-2)  is  equivalent  to  ( N ( t  )  )  having  a  repre¬ 
sentation  of  the  form 

M 

N ( t )  -  2  ftF  (t.s)dB  (s)  (2. 1) 

i  =  1  J0 

where  {(B  (t)):  i  <  M  ,  t  in  [ 0 , T ] }  is  a  family  of  independent-increment 
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mu t ua 1 1 y- independen t  path-continuous  zero-mean  Gaussian  processes,  and 
each  is  a  Bo r e 1 -measurab 1 e  function  on  [0,T]x[0.T]  with  F.(t,s)  =  0 
for  s>t.  This  representation  also  satisfies 

M  -p  n 

iff  FZ  (  t  ,  s  )  dj3  .  (  s  )d  t  <  00  ,  where  [5.  is  the  Borel  measure  on  [O.T] 
i  =  l  J0  J0  1  1  1 

defined  by  the  non-decreasing  variance  of  { B  ( t ) ) : 

P.(a,b]  =  EBj(b)  -  EB^(a) . 

The  representation  (2-1)  is  taken  to  be  the  proper  canonical 
representation  for  { N ( t ) )  [10].  One  consequence  is  that  the  completion 
of  the  a-field  a{B^(s):  i<M,  s<t)  is  the  same  as  the  completion  of 
cr{N(s):  s<t}  for  each  t  in  [O.T],  In  general,  the  equality  (2-1)  holds 
almost  surely  dP  for  each  fixed  t  in  [0,T];  by  assuming  that  (N(t))  is 
separable  w.r.t.  closed  sets,  one  obtains  a.s.  path  equality. 

The  basic  results  on  non-singular  detection  of  a  possibly 
nonGaussian  signal  embedded  in  additive  Gaussian  noise,  as  given  in 
[5],  entail  both  a  set  of  sufficient  conditions  [5,  Theorem  2]  and  a 
set  of  necessary  conditions  [5,  Theorem  3].  The  sufficient  conditions 
for  absolute  continuity  on  LgtO.T]  are  given  in  the  following  result. 


Proo.  2.1  f  5  .  Theorem  3]  Let  { V  (  t )  )  be  a  stochastic  process  indepen¬ 
dent  of  (N(t)).  Suppose  that  (S(t))  is  a  stochastic  process  adapted  to 
a(N)  v  a(V)  and  with  paths  a.s.  in  .  If  Y(t)  =  S(t)  +  N(t)  a.e. 
dtdP,  then  p,y  <<  p^  . 

Both  the  sufficient  conditions  and  the  necessary  conditions 
include  the  requirement  that  the  signal  process  have  a  representation 
with  almost  all  paths  in  the  reproducing  kernel  Hilbert  space  of  the 
noise  covariance  function  r^.  With  the  representation  (2-1),  this 
means  that  almost  all  sample  functions  of  the  signal  process  have  a 
representation  of  the  form 


S(t) 


(2-2) 


where  { Q  (  t  ) )  is  a  stochastic  process  with  almost  all  paths  in  2  [  P  j  ]  : 
T  2 

f  Q  (s)dp.(s)  <  m  a.s.  dP.  The  remaining  conditions  for  absolute 

J  0  1  1 

continuity  embody  measurability  conditions  on  the  signal  process. 
These  conditions  are  given  in  terms  of  the  noise  process  (N(t))  and  a 


stochastic  process  ( V { t )  )  independent  of  the  noise.  They  are 


essentially  related  to  the  signal  process  being  a  causal  functional  of 


the  two  processes  ( N  (  t ) )  and  ( V  (  t ) )  .  The  basic  idea  is  that  the  signal 


may  be  a  causal  functional  of  both  the  noise  process  (as  in  the  case 
of  dependent  signal  and  noise)  and  an  independent  "message"  process. 

The  likelihood  ratio  dpg+^/dp^  for  this  problem  is  given  in  [5. 
Theorem  7],  Define  a  vector  stochastic  process  (Z(t))  by 


zi ( t )  =  JC  Q.(s)dp.(s)  +  B.(t) 


(2-3) 


where  the  processes  (Q.(t))  are  those  appearing  in  (2-2)  above.  Then 


M  t 

t)  +  N(t)  =  2  r  F  (t.s)dZ  (s) 

i  =  1  J0 


(2-4) 


The  vector  processes  (Z(t))  and  (B(t))  define  probabilities  Pg  and 

on  the  space  of  all  M-component  vector  functions  on  [0,T]  whose 

component  functions  are  all  continuous.  Under  the  conditions  for 

existence  of  du„  ,,/du.. ,  dP^/dP,,  will  also  exist,  and  for  an 

S+N  N  ZB 

observation  x  in  L„[0,T], 


[dMS+N/dMN^ =  [dPz/dPB](m[x]) 


(2-5) 


■i.e.  dpN(x).  m(x)  is  an  M-component  vector  of  continuous  functions, 


defined  by 


m.[x](t)  =  2  <x,e  ><f  ,e  >/X  , 

l L  J  v  .  .  n  t  n  n 

n  >  1 


(2-6) 


f j(s)  =  J*  F.(s,u)dp.(u). 


The  likelihood  ratio  dP^/dP^  of  (2-5)  has  some  explicit 


known 


representations  [11],  depending  on  the  properties  of  ( Z ( t  )  )  .  These 

representations  are  based  on  the  fact  that  each  (B  (t))  is  a  path- 
continuous  Gaussian  martingale. 

The  results  given  above  are  for  continuous-time  observations.  In 
sonar  applications,  it  is  desirable  to  have  discrete-time  recursive 
algorithms,  which  do  not  require  complete  recomputation  of  the  test 
statistic  each  time  a  new  data  point  is  received.  Moreover,  it  is 
desirable  to  have  an  algorithm  with  parameters  that  can  be  estimated 
from  data,  since  a  complete  data  model  will  not  usually  be  available. 
Such  an  .algorithm  will  now  be  derived.  It  will  be  based  on  the 
following  additional  assumptions: 


(A. 2-3)  The  noise  process  has  multiplicity  M=l,  and  the  process 

(Bj(t))  is  the  standard  Wiener  process  (W(t));  thus 

N(t)  =  ft  F(t,s)dW(s),  where  F  is  a  Volterra  kernel  with 

J  0 

fT  fT  F2(t,s)dsdt  <  ®; 

J0  J0 

(A. 2-4)  The  process  (Z(t))  defined  in  (2-3)  is  a  diffusion  with 
respect  to  the  Wiener  process  and  has  memoryless  drif  t 

function,  so  that  Z(t)  =  f'’  or[Z(s)]ds  +  W(t).  (2-7) 

J0 

The  assumption  (A. 2-3)  is  reasonable  from  several  viewpoints, 

such  as  the  fact  that  mu  1  t  i  p  1  i  c  i  t  y  -  one  processes  are  dense  (by  a 

mean-square  distance  criterion)  in  the  space  of  all  second-order 
processes,  and  that  any  Gaussian  vector  can  be  represented  as  the 
result  of  a  lower-triangular  matrix  operating  on  white  Gaussian  noise. 
One  can  also  show  that  the  assumption  (A. 2-3)  is  satisfied  whenever 
the  noise  process  has  a  proper  canonical  representation 

NT  (  t )  =  fC  F(t,s)dB(s),  where  the  variance  of  (B(t))  is  an  absolutely 

J0 

continuous  function  on  [0,T]. 

The  assumption  (A. 2-4)  is  less  tenable;  it  is  made  primarily  for 
computational  convenience  (which  is  in  fact  not  very  convenient,  even 
so)  when  the  s  i  gna  1 -p 1 us-no i se  statistics  are  unknown.  It  does  permit 
one  to  consider  a  very  large  class  of  s i gna 1 -p 1 u s-no i se  processes 
without  having  complete  knowledge  of  the  statistics.  Of  course,  if  a 
complete  mathematical  model  is  available,  the  assumptions  (A. 2-3)  and 
(A. 2-4)  need  not  be  made  (if  dP_/dP„  can  be  determined). 

L  D 

For  the  detection  problem  as  defined  above,  the  general  form 
(under  a  mild  restriction)  of  the  likelihood  ratio  is 

[d'is  +  fv/d'JN]  (x)  =  lim  exP  LAV)] 

n 

where  0  =  t  ™  <  t  *?  <  t  ^  <  .  .  .  <  t. n  =  T  is  a  partition  of  [0,1]  such 
0  12  n  L  J 

that  sup  I  t n  .  -  t n  I  -*  0 , 

j  '  .1  +  1  .3  ' 

n  -  1 

An(x)  =  2  a  (  m  [  x  ]  (  t  ?  )  )  (m[x](  t  *?  .  )  -  mOJfr'1)) 

i  =0 

(2"S) 

n  -  1  ,. 

-  (1/2)  >1  a  (m[x](  i  V )  )  (  t"  -  «"). 

i  -0 

a  nd  the  limit  exists  in  the  norm  of  I  [ p . ,  ]  . 


The  representation  of  (N(t))  by  N  =  fC  F(t.s)dW(s)  yields  that 

J0 

R^  =  FF*.  where  F  is  the  integral  operator  with  F(t.s)  as  its  kernel, 
and  F*  is  its  adjoint.  This  can  be  used  to  provide  an  expression  for 
the  function  m  appearing  in  (2-5)  and  (2-6)  that  does  not  require 
calculation  of  eigenvalues  and  eigenvectors. 

T  t 

First,  notice  that  <e  .  ,  f  >  =  P  f  F(s,u)du  e.(s)ds 

J  t  J0  Jq  j 

_  Pc  p^suje  .(s)dsdu  =  [LF*e.](t),  where  [  L  f  ]  (  t )  =  P1"  f(v)dv.  Using 

J0  Jq  J  J  J0 

this,  the  expression  (2-6)  for  m  can  be  rewritten  as 

k  -1 

m[x](t)  =  [LF*  2  <e.,x>  Rn  e.](t) 

k-*00  1  J  W  J 

=  lim  [LF^R^P  x](t) 
k-*00 

where  P^x  is  the  projection  of  the  function  x  on  the  subspace  spanned 

by  (e^ . ek)  -  Since  R^  =  F*  *F  *,  the  preceding  becomes  m[x](t) 

=  lim  [LF_1P  x](t). 
k-*» 

A  basic  difficulty  is  that  (with  probability  one  [3])  the 

observation  x  will  not  be  in  the  domain  of  the  operator  F  so  that 

F  *x  is  not  defined.  In  fact,  LF  *  will  in  general  not  be  a  bounded 
linear  operator.  However,  for  almost  all  sample  functions  x  (either 
from  noise  or  signal-plus-noise)  ,  m[x](")  is  a  continuous  function  of 
t.  Thus  the  map  m  is  a  linear  operator  from  LgfO.T]  into  C[0,T]  whose 
domain  includes  (with  probability  one)  all  sample  functions  of  the 
noise  and  s i gna 1 -p 1 us-no i s e  processes. 

The  difficulty  in  implementation  of  the  likelihood  ratio  (2-8) 
will  lie  in  determining  the  function  a  and  linear  operator  m.  o  is  a 
parameter  of  the  s i gna 1 -p 1 u s -no i s e  process,  and  its  estimation  is  a 
problem  of  considerable  interest  in  stochastic  processes  (as  the  drift 
of  a  diffusion)  and  in  stochastic  Filtering.  The  possibly  unbounded 
linear  operator  m,  mapping  LgCO.T]  into  C[0,T],  depends  only  on  the 
covariance  function  of  the  noise.  If  the  noise  covariance  function  is 
known,  then  the  preceding  expressions  can  be  used  to  obtain  a 
d i sc r e t e- t i me  finite-sample  approximation  to  the  likelihood  ratio. 
Here  we  consider  such  approximations  when  one  knows  only  the 
covariance  matrix  of  the  noise. 

Let  R„  denote  the  covariance  matrix  of  the  noise;  one  can  write 
RN  =  F  F * .  where  the  matrix  F  is  lower  triangular.  Now,  the  expression 
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for  m  given  above  is  of  the  form 

m[x](t)  =  lim  [LF_1P  x](t), 

k-*co 

where  =  FF*.  L  is  the  integration  operator,  and  is  the 

projection  of  x  onto  the  subspace  spanned  by  (e^ . e^}.  w^>ere 

{e  .  n>l)  are  o.n.  eigenvectors  of  R.r .  Thus,  a  reasonable  procedure  is 
n  N 

simply  to  replace  this  expression  by  m[x]  =  L  F  *x,  where  x  is  the 
observed  data  vector,  and  L  is  the  summation  operator  in  E  ; 

j 

( L  x )  .  =  2  x  .  . 

J  i  =  l  1 

There  is  a  fundamental  difference  between  the  above  approximation 

to  m  and  the  exact  result.  As  previously  mentioned,  F  ^x  is  (with 

probability  one)  not  defined  for  the  continuous-time  situation;  here, 

of  course,  there  is  no  such  problem  for  F  ^x. 

Implementation  of  the  di sere te- t ime  algorithm  for  a  fixed  sam¬ 
pling  interval.  A,  will  now  be  considered.  Then,  when  the  observation 
is  an  n-component  vector,  and  the  above  approximation  is  used,  one 
obtains  as  an  approximation  to  the  log-likelihood  ratio  the  expression 

An(x)  =  V  (o[(L  F_1x)  ])[(L  E_1x)  -  (L  F-1x)  ] 

j=0  J  J  1  3 

n~  *  2  - 1 

-  (A/2)  2  a  [(L  F  x).]  (2-9) 

j=0  J 

=  V  (a[(L  F-1x)  .])[(F'1x)  ]  -  (A/2)  V  a2  [(L  F_1x)  ]. 

j=0  J  J  1  j  =0  3 

If  now  a  new  data  point  xn+^  is  observed,  the  approximation  has  the 
recursive  form 

An+1(x)  =  An ( x )  +  a[(L  F"1x)n](F"1x)n+1  -  (A/2)  a2[ ( L  F~ 1 x ) n ] .  (2-10) 

One  notes  the  following-’ 

(1)  Implementation  and  calculation  of  A  require  the  following 
operations.  First,  the  function  a  must  be  known  and  pro¬ 
grammed.  Given  the  value  of  An(xn)  and  the  observation 

xn  =  (Xj . x^),  °ne  stores  An(xD),  xn.  a  [  ( L  F  ^x)  ],  and 

(L  F  ^ x n )  When  the  data  point  x  is  received,  it  is  only 


which  means 


necessary  to  use  xn+ ^  to  calculate  (F  ^xR+^)n+^. 
to  cross-correlate  the  observation  xn+ *  with  the  n+ 1  row  of 
F  This  number,  say  b^+  ^  ,  is  then  used  to  form  An+^(x_n+*), 

An+1(xn+1)  =  An(xn)  +  a[2  b.]bn+1  -  (A/2)  a2  [2  b.]. 

This  is  much  simpler  than  a  procedure  whereby  the  function 

m  =  L  F  ^  is  expressed  in  terms  of  its  eigenvalues  and 
eigenfunctions,  since  those  quantities  would  have  to  be 

stored  for  En  and  all  the  sample  indices  n  >  1,  and  a  com¬ 
plete  new  calculation  done  for  each  new  sample  point 
observed . 


(2) 


(3) 


As  already  noted.  the  expression  An(xn)  can  only  be 
considered  as  an  approximation  to  the  d  i  sc r e t e- 1 ime 
likelihood  ratio.  This  approximation  becomes  more  valid  as  n 
increases,  since  it  amounts  to  representing  the  noise  vector 


i  - 1 

N  by  N  .  =  2  F  .  .  ( 4W.)  •  .  where  A)V  is  a  vector  of  i.i.d. 

1  j  =  l  1J  J 

i  - 1 

N  (  0  ,  A  )  random  variables.  As  n  increases,  2  F(iA,  jA)(AW). 


j  =  l 


J 


will  converge  in  mean-square  to  N(iA)  =  N(t),  keeping 
iA  =  t,  where  the  function  F  is  that  appearing  in  the 


representation  N  =  P*"  F(t,s)dW  .  Thus,  as  n  increases,  the 

1  J0  S 
i  - 1 

representation  of  N  by  N  (  t )  =  2  F.  .(A W.)  .  converges  to  the 

j=l  1 J  J 

representation  satisfying  the  original  con t i nuou s- t i me 

models  for  noise  and  s i gna 1 -p 1 u s-no i se . 

ct  can  be  estimated  from  a  sample  of  data  representative  of 
s i gna 1 -p 1 us-no i se  .  In  discrete  time,  the  procedure  is  as 
follows,  given  an  observed  S+N  vector  X. 


a)  Form  AZ^  =  F  X,  where  =  F  Fj * ,  F  1  owe  r- t  r  i  angu  1  ar  , 
(AZ).  =  Z ( i A )  -  Z( [ i -  1 ] A ) .  ZQ  =  0. 

i  -  1 

b )  Z (  i  A  )  =  A  2  a(ZjA)+W(iA). 


Given  the  sample  vector  Z  obtained  from  b),  the  function  a  can  be 
estimated.  A  max i mum- 1 i ke 1 i hood  procedure  is  given  in  [8], 

Of  course,  the  approximations  (2-8)  and  (2-9)  need  not  be 
likelihood  ratios  for  a  fixed  finite  set  of  sample  points.  However,  it 


will  now  be  shown  that  (2-9)  is  a  likelihood  ratio  when  the  function  a 
is  linear.  In  this  case,  S  +  N  is  Gaussian,  so  that  the  likelihood  ratio 
du„  ../du.T  can  be  found. 

In  accord  with  the  model  for  { Z ( t ) ) .  the  d i sc  re t e - t i me  represen¬ 
tation  is  (for  a  linear) 


Z,  ,  =  A  2  a  .Z  .  +  W,  ,  . 
k+1  .  j  J  J  k+1 


It  will  be  shown  that  for  an  observation  vector  x  in  E  , 

-1  -1  n~*22  n-^ 

-  x*(E_z  -  E.w  jx/2  =  -  (A/2)  2  a.x.  +  2  a.x.(xi  +  1-x.) 

i  =  1  i  =  1 


(2-11) 


(2-12) 


The  LHS  of  (2-12)  is  the  1  og- 1 i ke 1 i hood  ratio  (within  a  constant)  of 
dP^/dP^.  Given  the  equality  (2-12),  if  one  has  that  N  =  F  AW . 

S  +  N  =  F  AZ_,  then  [dp^/d^]  (x)  =  [dP^/dP^]  (  F~  1  x)  . 


Thus  let 


Z.  =A  2  a.Z.+W.,, 

k+1  J=1  J  J  k+1 


Z !  =  V 


k>  1  , 


Let  A  be  the  matrix  d  i  ag  [a  ^  ,  .  .  .  ,  a^  ]  .  The  RHS  of  (2-9),  evaluated  at 


y_  =  F  L  x.  then  becomes 


2  (Ax)  (x  -x  )  -  f  A  2  ( Ax ) ^  . 

j=l  J  J  J  j=l 


(2-13) 


To  show  that  (2-9)  is  a  likelihood  ratio  test  statistic,  it  will  first 

be  shown  that  (2-13)  is  equal  to  -  x*(R  Z_R^  )x/2  =  log  [dP^/dPw](x) 

+  cons  tan  t  . 

The  above  representation  for  Z  gives 
(I  +  AA)Z  =  AL  A  Z  +  W 

so  Z  =  B_1W 

B  =  I  +  AA  -  AL  A. 

Z  thus  has  covariance  matrix  R_  =  B  ^  ,  so  R  ^  =  B*R  ^B.  Since 

z  w  z  w 

R  w  (  i  .  j  )  =  Amin(i.j),  R  ^  =  ALL**,  and  thus 

R~  1  =  (I  +  AA  -  AA  L* )  L*~  1 L _  1  ( X  +  AA  -  AL  A)/A 

where  (L  *  )  =  1  if  i  =  j 

=  -  1  if  i=.j  + 1 
=  0  otherwise. 


rh 


is  gives  -  R  ^ 


-  [AAL*  lL  1 A  +  AL*  lL  1  +  L*  *L  *A  -  AL  1 (I+AA)  -  (I+AA)L*  *A  +  AA2] 


and  for  a  data  vector  x. 


x  *  (  R  z  1  -  Rw  1  )  x  =  A  (  L  1  Ax  )**L  1  Ax  +  2(L  lx)*L  1  Ax 


-  2  ( L  1x)«Ax  -  2  A  (  L  ^^Ax  +  A(Ax)*Ax 


n  1  22 

The  three  terms  containing  A  sum  to  A  2  aTx  .  ,  while  the  other  two 

i  =  l  1  1 


n  -  1 


terms  sum  to  -2  2  a.x.(x.  ,-x . ) .  so  that 

.  ,  l  l  v  l  +  l  l 

1  =  1 


1  .-1 


n-  1 


x*(R7  -Ru  )  x  =  —  2  2  a.x.(x.  -x.)  + 

-v-z  -w  '  -  .  j  JJVJ+1  y 


n  1  22 

A  2  a.x..  as  desired 

J  J 


j  =  l 


-  1 


This  shows  that  (2-9),  evaluated  at  y  =  F  L  x,  satisfies 


An(F  L  *x)  =  -  x*(R.z*  -  R^1)x/2  =  log  [dP^/dP^Jfx)  +  constant.  More¬ 


over.  [d*ig  +  N/dfiN]  (y)  =  [dPAZ/dPAW^-  ^  =  CdPz/dPw](L  L  1  ii)  •  the  last 


equality  because 


AZ 


L  1 Z  , 


AW  =  1.  *W. 


With 


.  -1 

iL  =  L  L  x. 


Cdhs  +  N/d/iN  ]  (y.)  =  [dPz/dPw](x)  =  (from  above)  exp[An(y)  +  constant] 


Thus,  when  the  above  assumptions  are  satisfied  (including  the 
assumption  that  Z  is  a  Gaussian  vector),  the  approximation  given  in 
(2-9)  is  a  discrete-time  finite-sample  likelihood  ratio. 


3.  SPHERICALLY- INVAR  I  ANT  NOISE  (SIN)  MODELS 


Let  (N(t)),  t  in  T,  be  a  real-valued  zero-mean  stochastic  process 
on  a  probability  space  ( 0 ,  (3 ,  P )  .  N  is  said  to  be  spherically  invariant 
if  it  has  the  representation  N(t)  =  AG(t)  for  each  t  in  T.  where  G  is 
a  Gaussian  process  and  A  is  a  random  variable  which  is  independent  of 
G  and  which  has  finite  second  moment.  Since  it  can  be  assumed  that 
2 

EA  =1.  the  covariance  of  N  can  be  taken  to  be  the  same  as  that  of  G. 
Thus,  the  finite-dimensional  distributions  of  N  are  completely 
determined  by  its  covariance  and  by  the  distribution  of  the  random 
variable  A.  SIN  can  thus  be  viewed  as  a  first  step  away  from  Gaussian 
noise  . 

If  the  random  variable  A  is  discrete,  then  the  distribution  of 
the  random  vector  (  N  {  t  .  ) . N  (  t  )  is  given  by  the  density  function 
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f(N)  =  2  p.n(O.a^R) 

i  =  1 


(3-1) 


j* 

where  n(a,B)  is  the  density  of  a  Gaussian  random  vector  (in  E  )  with 
mean  a  and  covariance  matrix  B.  In  the  representation  (3-1), 
P[A=a^]  =  p^,  and  K  <  00  is  the  number  of  distinct  values  that  A 
assumes  with  positive  probability.  In  this  paper,  it  will  be  assumed 
throughout  that  A  is  a  discrete  random  variable.  We  also  assume  (WLOG) 
2 

that  EA  =1  and  that  A  is  strictly  positive. 

The  model  for  univariate  impu 1 s i ve-p 1 u s-Gau s s i an  noise  developed 
by  Middleton  [12]  takes  two  basic  forms,  defined  as  Class  A  and  Class 
B,  depending  on  the  relative  bandwidth  of  noise  and  receiver.  The 
Class  A  model  is  defined  to  exist  when  the  impulsive  noise  pulses  do 
not  cause  transients  in  the  front  end  of  the  receiver;  it  is  thus  a 
model  for  narrowband  noise.  The  univariate  density  function  as 
developed  by  Middleton  has  the  form  [12] 


c ,  -u  „  u  _ 

f(x)  =  e  I  —  _  exp 

„  m !  a  ^ 

m  =  0  v2t r  m 


(3-2) 


where  U  is  the  "overlap  index"  and  (am)  is  a  sequence  of  variance 
components.  The  overlap  index  is  defined  to  be  the  average  number  of 
arrivals  per  second  multiplied  by  the  average  length  of  the  pulse.  The 

2  2-1 

variance  component  a  is  defined  by  a  (mil  +  T)/(l  +  f" )  ,  where  T 

m  m 

is  the  ratio  of  the  intensities  of  the  Gaussian  and  non-Gaussian 
components  of  the  noise. 

It  can  be  seen  that  (3-2)  is  the  probability  distribution  of  a 

spherical ly-invariant  random  variable  X  =  AY,  where  Y  is  a  zero-mean 

unit-variance  Gaussian  r.v.,  and  A  is  an  independent  r.v.  taking  the 

values  (a  )  with 
v  m  ‘ 


=  P [ A=a  ]  =  Um  e  U/m ! 
L  m J 


(3-3) 


m  i ? m  -U 
co  U  e 


In  f  ac  t  ,  E  A" 


(jy  +  r)/( i+r)  =  i . 


In  [16],  Spaulding  and  Middleton  analyse  the  problem  of  detecting 
a  known  signal  in  Class  A  noise  by  assuming  independent  sampling,  so 
that  the  sampled  noise  data  has  joint  density  function  (n  samples) 


p(x)  =  17 


i = 1  m=0 


(3-4) 


However,  if  the  r.v.  A  is  constant  over  the  sampling  interval,  then 
the  density  of  X  =  AY,  where  the  components  of  Y  are  i.i.d.  N(0,1).  is 


P(x)  = 


00 

2 

m  =  0 


1 


2t r  a 


2  i  n/2 


exp 


m 


II  x  II 
2a2 


2i 


(3-5) 


2 

where  llxll  =  2 


i  =  1 


2 

x  .  . 

l 


When  the  Gaussian  process  Y  has  non-singular 

covariance  matrix  R,  then  the  class  A  noise  has  joint  density  (if  the 
r.v.  A  is  cons  ant  over  the  observation  interval) 

1 


P(x)  = 


CO 


m  =  0 


2  tt  a 


m 


n/2 


(  de  t  R ) : 


exp 


-z"  x*R  ^x/a2 
—  m 


(3-6) 


As  will  be  shown  in  the  next  section,  for  reasonably  large  n  it 
is  not  necessary  to  know  the  values  of  U  and  T  in  order  to  implement 
this  detector.  This  fact,  as  well  as  the  joint  density  (3-6), 
illustrates  some  of  the  advantages  of  using  a  general  SIN  model 
whenever  appropriate. 

Of  course,  SIN  models  are  not  limited  to  the  Middleton  model. 
They  cover  a  large  family  of  smooth  unimodal  densities  that  are 
symmetric  about  their  mean.  NonGaussian  examples  of  spherically- 
invariant  distributions  include  the  t  and  double-exponential  [9]. 


4.  DETECTION  IN  SPHER ICALLY- INVAR  I  ANT  NOISE  (SIN) 

In  this  section,  (N(t))  will  be  SIN  with  representation  (AG(t)). 
(G(t))  is  a  m.s.  continuous  zero-mean  Gaussian  process  and  A  is  a 
strictly-posi tive  discrete  random  variable  independent  of  (G(t))  and 

2 

with  EA  =  1.  ( N ( t )  )  thus  has  zero  mean  and  covariance  the  same  as  the 

covariance  of  (G(t)).  A  takes  on  the  value  a.  with  probability  p^O. 

Likelihood  ratio  detection  of  a  known  signal  in  SIN  has 
previously  been  considered  by  Yao  [18]  for  a  very  special  case:  the 
threshold  on  the  likelihood  ratio  test  statistic  is  unity.  It  has  also 
been  considered  by  Spooner  [17]  for  a  specific  distribution  of  the 
mixing  random  variable  A.  A  more  comprehensive  treatment  has  been 
given  by  Picinbono  and  Vezzosi  [13].  Their  work,  and  that  of  Spooner 
and  Yao,  has  been  for  the  d i sc r e t e - t i me  f i n i t e- samp  1 e- s i ze  problem. 
However,  these  authors  all  use  or  permit  continuous  mixing  r.v.'s  A. 
Our  choice  of  a  discrete  r.v.  for  A  permits  analysis  of  the  continuous 
time  problem  without  introducing  much  mathematical  complication.  It  is 
also  sufficient  to  apply  our  results  to  detection  in  Middleton’s  ('lass 


A  noise. 


The  first  topic  to  be  addressed  here  is  that  of  absolute 
continuity  and  likelihood  ratio.  Sufficient  conditions  are  contained 
in  the  following  result. 


Prop .  4.1.  Suppose  that  (Y(t))  is  a  stochastic  process  adapted  to 

ct(G)  v  ct(V),  where  { V  {  t ) )  is  any  process  independent  of  { N  {  t ) )  . 

Suppose  also  that  Y(t)  =  S(t)  +  N(t)  a.e.  dtdP,  where  (S(t))  is  a 

stochastic  process  adapted  to  a(  Y )  and  with  almost  all  paths  in  Hf 

Moreover,  p_  _  <<  p  „  for  all  a>0,  and 
S  +  aG  aG 


N‘ 


Then  py  <<  pN . 


EdpS+N/dpN](x)  =  2i  :C(a.)(x)  tdpS  +  a.G/dpa.G^X> 


(4-1) 


a.e.  dp^(x).  In  (4-1),  the  sum  is  over  all  a.  such  that  P[A=a  ]  >  0, 
I ^  is  the  indicator  function  for  the  set  C  in  L2[0,T],  and 

1  n  2  2 
C ( a . )  =  {x:  lim  -  2  <x,e.>  /A.  =  a  }. 

1  n  n  j  =  1  J  J 

Moreover,  if  (S(t))  is  any  process  such  that  P[A=a^]  >  0  implies 
^S  +  a  G  Ma  q  .  then  4g  +  ^  <<  4^  and  dps+^/dp^  has  the  representation 

(•1-1  )  . 


Proof  .  If  (Y  )  is  adapted  to  o(G)  v  a  ( V )  ,  then  (Y  )  is  adapted  to 

**  l 

a(aG)  v  a ( V )  for  any  constant  a.  Since  Hw  =  H_  =  H  „ ,  p„ ,  „  < <  1 j  _  for 

IN  O  3,(j  S  +  3.G  aC» 

any  positive  constant  a,  by  Theorem  3  of  [5].  Then  for  any  Borel  set  B 
of  L2[0. 1] , 


ms+n(b)  =  2 


i  Pi4S+a.G(B)  =  2i  PiJ_  ^dpS+a.G/dpa.G^  (x)  dpa.C(x) 

l  B  l  l  l 


1  2  2 

Now,  lim  ^  2  <x,e^>  /A^  =  a^  w.p.l  when  A  =  a^,  under  both  p^  and 


4g  +  pj  •  To  see  this  for  pN  (noise-only  data),  one  notes  that  the  random 


N 


variable  <x,e.>/\2.  has  the  form  a.<G,e.>/A^  when  only  noise  is  present 
J  J  1  J  J 


and  A  =  a^  The  random  variables  (<G,ej>/\j:  j  > 1 }  are  i.i.d.  N(0,1). 


1  2 

Ihus,  by  the  law  of  large  numbers,  —  2  <G,e.>  /A.  ->  1  with  probability 

n  j  J  J  H  y 


n 


one.  When  A=a.,  -  2  <N,e.>2/A. 

in.,  l  i 

.1  =  1 


a.  w.p.l  (p^).  If  signal  is  present 


n 


1  "  2  2 

and  A  =  a  .  ,  then  lim  —  2  <S+a.G,A.>  /A,  is  aga in  equal  to  a.  w.p.l.  To 

1  „  n  ,  it]  i  K 

n-xn  1  u  j 


see  this,  note  that  since  S  is  in  HXI  w.p.  1,  2  <S,e.>  /A.  is  finite. 

N  1  J  J 


1 


n 


1 


n 


This  implies  that  both  —  2  <S,e.>  /A.  and  —  2  <S  ,  e  . > <a  G , e  .  >/X  . 

K  n  ^  J  J  njjijj 

converge  to  zero  w.p.  1. 

The  preceding  shows  that  p  _[C(a.)]  =  1  and  that  p  r[C(a,)]  =  ® 

i  J 

for  i ^  j -  Thus , 


mS*N(B>  -  J  Pi  I,  ,  CdPS+a.G/d,1a.G](x>  dMa.G<x) 


i  BnC(a.) 


=  2  0rX  ,  [dwS»a  .  G/dl,a  .  G]  ( x  >  d|IN(x> 
i  BDC(a . )  l  l 


because  pN(BflC(a.))  =  p.p  _(BnC(a.)).  (4-1)  now  follows  by  the 

IN  1  I  3  .  1 


monotone  convergence  theorem. 


The  expression  for  the  likelihood  ratio  given  in  (4-1)  partitions 
the  Borel  o-field  of  L^CO.T]  into  two  major  subsets.  These  sets  are 


1 


n 


U  C(a.)  and  its  complement,  where  C(a.)  =  (x:  lim  —  2  <x,e.>  /A. 

i  1  1  n  n  J-l  J  J 


=  a.}.  It  is  noteworthy  that  the  likelihood  ratio  does  not  involve  the 


probabilities  P{A=a.}.  These  facts  show  first  that  the  important 


factor  in  determining  the  likelihood  ratio  for  detection  in 
continuous-time  SIN  is  knowledge  of  the  values  which  can  be  assumed  by 
the  mixing  random  variable  A.  However,  no  penalty  is  assessed  if  one 
includes  too  many  possible  values  of  A.  That  is,  if  b  is  not  a 

n 


1  2  2 

possible  value  of  A,  then  the  set  C(b)  =  {x:  lim  —  2  <x,e.>  /X.  =b  } 

n  n  j=l  J  J 


has  zero  p^-p r obab i 1 i ty 


and  so  addition  of  the  term 
I  c  ^  b  )  (  x  )C  ^s  +  bG^^bG  -1 1 x  ^  to  will  not  affect  (with  probability 
one)  performance  of  the  test  statistic. 

A  particular  application  of  the  above  is  the  situation  when  the 
noise  can  be  either  Gaussian  or  spherical ly-invariant  nonGaussian.  If 
P[A=1]  >  0  holds  for  the  mixing  r.v.  in  the  nonGaussian  case,  then  the 
likelihood  ratio  (4-1)  will  still  be  a  likelihood  ratio  if  the  noise 
is  in  fact  Gaussian.  If  P[A=1)  =  0  for  the  nonGaussian  SIN  model,  then 

dp, 


S  +  G 


one  can  add  the  term  I-w,.(x)  , 

C(l)v  '  dp. 


(x)  to  the  likelihood  ratio  (4-1). 


The  resulting  sum  will  be  a  likelihood  ratio  when  either  hypothesis  is 


true. 
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In  the  remainder  of  this  section,  attention  will  be  restricted  to 
the  problem  of  detecting  a  known  signal  S  in  additive  SIN.  In  the  case 
of  Gaussian  noise,  it  is  well  known  that  the  likelihood  ratio  exists 

(non-singular  detection)  if  and  only  if  S  is  in  range(R^).  The  same 
result  holds  if  the  noise  is  any  SIN  process. 


Prop .  4.2.  If  S  is  a  fixed  element  in  ^[O.T],  then  either  p<^  +  ^  -1  P^ 

or  else  pg  +  ^  <<  p^  and  p^  <<  pg  +  N>  Mutual  absolute  continuity  holds  if 

±- 

and  only  if  S  is  in  range(R^). 

Proof  .  If  the  Lo[0,T]  equivalence  class  generated  by  S  is  not  in 


range ( R^  )  ,  then  p 


±  p  for  each  a.,  using  the  known  results  for 

b^3.  .  VJ  3  .  VI  1 

1  1 


the  Gaussian  case.  As  shown  in  the  proof  of  Prop.  4.1  (and  well-known 


[14]),  p  _  1  p  _  for  i^j.  Moreover,  p 

3.  .  Vt  3  .  Lt 


S  +  a .  G  1  ^a.C  for  since 


j-  [22  -1-] 

range  ( Rj^ )  =  range  (a.Rj^  +  a^.Rj^  )  2  and  S 


must  belong  to  this  latter  range 


space  in  order  to  have  mutual  absolute  continuity  of  pe  n  and  p 

i  j 

K 

[14],  Thus  ps+a  1  pa  G  for  all  i  and  j.  so  that  pg+a  1  2  p  p 

i  J  i  j  =  1  j 

K  K 

for  i  =  1 . K.  This  gives  2  p.ps+a  G  1  2  p  p  G>  or  p  1  pN . 

i = 1  “  i  j=l  J  j 

-1_ 

Conversely,  if  S  is  in  range(R^),  pg  +  a  G  ~  Pa  G  for  i=l . K,  so 

i  i 


.  .  P  i  ^S  +  a  .  G  ~  ^j^a.G' 

i  =  l  i  J  =  1  J 


Performance  of  the  likelihood  ratio  (4-1)  can  be  computed  for  the 
case  of  a  known  signal.  When  the  noise  is  Gaussian,  then  it  is 
well-known  that  the  performance  depends  only  on 

d2  =  IISII2  =  2  <S ,  e  >2/A 

N  .  ,  n  n 

n  >  1 

where  {A^,  n>l}  and  (e^,  n-0  are  the  eigenvalues  and  associated 
c.o.n.  eigenvectors  of  the  noise  covariance  operator  R  . 


19 


Prop.  4-3.  Suppose  that  the  signal  is  a  known  function  S  belonging  to 

X. 

range (  )  ,  and  that  P[A=ai]  =  p.  >0.  i>l.  Then  performance  of  a 

likelihood  ratio  test  statistic  is  given  by 


PpA  =  2  p.  P[Z  >  ka.  +  d/(2a.)]  (4-2) 

i 

PD  -  2  p  P[Z  >  ka.  -  d/{2a.)]  (4-3) 

i 

where  Z  is  distributed  N(0,1)  and  k  is  a  constant  whose  value  is 
determined  by  the  desired  value  of  PpA . 


Proof:  For  k  >  0, 


Pp a  =  »Vx:  ^  e  }  =  HN{x:  2  lr(*  ^(X)MX>  >  kd> 


S+N  N 


7  C( a  . ) v  '  i 
i  v  i  ' 


whe  r  e 


M*)  =  l°s[<ddS*a.G/d‘‘a.C><*>3  =  ^  [S  <*  •  V  <S  '  %>/Xn  ‘  d2/2 


Thus 


a .  n 

l 


PFA  =  1  Pi  “a.G^  ei(x>  2  k<*> 

1  1 

=  2  p.p^lx:  a. 2  <x,e  ><S,e  >/X  >  kda2  +  d2/2) . 

i  n  n  n  n  “  i 

i 


Since  the  random  variable  P  defined  by 

P  (x)  =  2  <x,e  XS.e  >/A 
v  '  n  n  n  n 

is  Gaussian  with  respect  to  ,  and  has  mean  zero  and  variance  d 
P pA  =  2.  p.  P[Z  >  ka.  +  d/(2a. ) ] . 

Pn  is  calculated  in  the  same  way. 

n 


As  can  be  seen  from  (4-2)  and  (4-3),  detection  performance 
depends  on  d  and  also  on  the  distribution  of  the  mixing  random 
var iab 1 e  A . 

The  likelihood  ratio  as  given  in  (4-1)  requires  prior  knowledge 
of  the  values  (a^)  that  can  be  assumed  by  the  mixing  random  variable 
A.  However,  this  prior  knowledge  is  not  necessary  in  order  to 
implement  this  detector. 

Prop.  4  - 4  .  For  a  known  signal  S,  a  likelihood  ratio  test  is  to  decide 
signal  present  if  and  only  if 

[  P  (  x  ) /A2 ( x )  -  d2/(2A2(x)]  >  k 


(4-4) 


where  k  is  determined  from  (4-2)  and  (4-3),  and 


~o  1  2 

\  ( x )  =  lira  ^  2  <x.ek>^/Xk. 

n  k=l 


~  2  2 

Proof .  If  x  €  C(a^),  then  A  (x)  =  a^  with  probability  one  under  or 

+  as  shown  in  the  proof  of  Prop.  4-1.  The  result  then  follows 
directly  from  the  expression  (4-1),  or  by  examining  the  proof  of  Prop. 


A  likelihood  ratio  detector  can  thus  be  implemented  without  any 
prior  knowledge  of  the  distribution  of  the  mixing  random  variable  A, 
provided  the  noise  is  in  fact  SIN.  However,  as  can  be  seen  from  the 
expressions  (4-2)  and  (4-3)  for  P^  and  P^ ,  likelihood  ratio  detection 
performance  depends  on  the  complete  distribution  of  A.  This  means  that 
it  is  not  possible  to  set  a  threshold  for  a  specified  P-  unless  one 
has  complete  knowledge  of  the  distribution  of  A. 

This  leads  one  to  consider  the  problem  of  CFAP  (constant  false 
alarm  probability)  detection,  which  has  been  treated  for  many  years  by 
designers  of  active  sonar  detection  systems.  In  this  traditional 
context,  the  detection  problem  is  that  of  detecting  a  signal  in 
Gaussian  noise  which  is  known  except  for  a  scale  factor.  It  is  desired 
to  have  the  same  probability  of  false  alarm  for  any  value  of  the  scale 
factor.  The  scale  factor  has  usually  been  treated  as  an  unknown 
parameter,  rather  than  as  a  random  variable. 

A  CFAP  detector  can  be  obtained  for  the  SIN  detection  problem  by 
using  the  following  decision  procedure: 

decide  signal  present  if  and  only  if 


£(x)/A(x)  >  kd. 


(4-5) 


when  A(x)  =  [A  (x)]2.  When  noise  only  is  present,  P  (x)  / A ( x )  =  P ( x ) /a 
with  probability  one  when  x  =  a.G.  Since  then  P ( x )  =  a.f(G).  one  has 

that  P(x)/ A(x)  is  Gaussian  with  zero  mean  and  variance  d^  and 

PpA  =  P[Z  >  k],  (4-6). 

When  this  detection  algorithm  is  used,  and  x  =  S  +  a.G,  then  P(x)/ a. 

2  2 

is  Gaussian  with  mean  d  /a^  and  variance  d  ,  so  that 


1  p  .  P[Z  >  k  -  d/a . ] . 


(4-7) 


The  difference  in  performance  between  the  optimum  detector  (4-4) 
and  the  CFAP  detector  (4-5)  will  depend  on  the  distribution  of  A. 
Figure  1  shows  an  example  using  a  distribution  for  A  obtained  from 
analyzing  under-ice  sonar  data.  The  curves  show  performance  for  the 
optimum  detector  (4-4),  the  CFAP  detector  (4-5),  and  the  matched 
filter  (A=l  w.p.  1)  when  the  noise  is  SIN  with  the  given  distribution 
for  A.  Also  shown  is  the  performance  that  one  would  obtain  using  the 
matched  filter  if  the  noise  were  truly  Gaussian.  The  difference  in 
performance  of  the  matched  filter  and  the  likelihood  ratio  illustrates 
the  significant  performance  loss  that  can  occur  if  the  noise  is  mis¬ 
takenly  assumed  to  be  Gaussian.  This  and  the  similarity  in  perfor¬ 
mance  of  the  CFAP  detector  and  the  likelihood  ratio  illustrate  the 
wisdom  of  using  a  CFAP  detector  if  there  is  a  possibility  that  the 
noise  is  SIN  with  unknown  distribution. 


22 


For  the  d i sc r e t e - t i me  f i n i t e- samp  1 e  detection  problem,  with  ob¬ 
servation  x  in  En,  the  likelihood  ratio  dPg+^/dP^  is  easily  seen  to  be 

K  K 

dPS  +  N/dPN  <*>  =  Vi^a.G^7.1  PjdPa.G<*>  (4'8) 

- — 1  i  =  l  ~  i-  J  =  1  J- 


where  dPn  is  the  multivariate  density  function  for  the  probability  Pn 

on  En.  In  contrast  to  the  con  t  i  nuou  s- 1  i  me  case  (4-1),  the 
probabilities  p.  =  P [ A  =  a  ^ ]  appear  in  (4-8).  Moreover,  the  likelihood 
ratio  (4-1)  produces  a  non-zero  value  (with  probability  one)  only  if 
the  observation  involves  a.G  for  a.  one  of  the  terms  included  in 

1  l 

(4-1);  otherwise,  the  value  of  the  likelihood  ratio  is  zero.  This  is 
not  true  in  the  discrete-time  case  of  (4-8).  In  the  case  of  a  known 
signal  S,  the  performance  (Pp^  and  P^)  of  (4-1)  depends  only  on  the 

2  _  J-  2 

distribution  of  A  and  on  d  =  IIR^2SII  .  The  d  i  sc  r  e  t  e  - 1  i  me  detector's 

performance  improves  as  the  sample  size  n  increases,  with  d  fixed. 

These  differences  can  all  be  understood  by  examining  the  form  of 

(4-8)  as  the  sample  size  increases. 

Suppose  that  noise  only  is  present,  and  that  the  mixing  r.v.  A 

ikes  on  the  value  a.,  so  that  the  received  waveform  x  is  a.G  with  G 

t  -  i  —  - 

multivariate  Gaussian,  zero  mean,  non-singular  covariance  matrix  R. 
Let  Yn  =  log  p.  dPn  (x)  =  log  p.  -  n  log  a.  -  a^llR  2GII*V(2a'T)  +  C 

-  log  p.  -  n  log  a.  -  a^  2  VfV(2a^)  +  C 
J  J  Jk=1  k  J 


where  C  is  a  constant  and  ( V ^ )  is  i.i.d.  N(0,1).  It  will  be  shown  that 

V n  -  Y n  -♦  - 00  w  .  p  .  1  .  Thus, 

J  i 

yn  _  yn  _  jQ  [p./p.]  -  n  log  [a, /a.]  +  2  V?  j[(  1  -  a^/a^) 

.1  i  .)  i  J  l  J  k=i  k  l  1  J  J 

2 

=  t  -  npu.  .  +  Px  u . 

i  .  J  n  l  ,  j 

where  x n  is  chi-square  with  n  degrees  of  freedom,  P  =  P(i.j) 

-  |  (  1 -a^/a^ )/2 |  .  p  =  p(i.j)  =  |  1 og  (a./a,)|.  a  =  a (  i  .  j )  =  log  p./p.. 

and  u .  =  1  if  a .  <  a  .  ,  u .  .  =  - 1  if  a .  >  a  .  . 

i  •  .1  i  “  J  i  .  J  i  J 

For  a .  <  a  .  ,  m  >  0 ,  x  =  a.G,  one  has  P [ Y n  -  Y n  >  - m ] 

=  J,[P1^Pa  c(x)  >  G(x)  ]  =  P[xa  >  “(m  +T)/p  +  np/p], 

"  .!'  '  i  ~ 

2 

Since  x“"/n  -*  1  w.p.  1  as  n  -*  m ,  and  p/p  >  1  for  a.  <  a  , 

^ 


P[limn  (x^/n  +  (m+'r)/pn  -  p/P)  >  0]  =  0,  for  any  fixed  m  >  0. 

If  a.  >  ajt  then  P[Yj  -  Y®  >  -m]  =  P[xjJ  <  (m+-r)/p  +  np/0],  In 

this  case,  p/p  <  1,  so  P[limn(x^/n  -(m+"r)/pn  -  p/P)  i  0]  =  0. 

Using  these  expressions,  one  can  obtain  the  value  of  n  required 

for  a  specified  approximation,  once  the  distribution  of  A  is  known. 

The  value  of  n  required  for  P[p  .dP^  ^(x)  ^  e  "V^P^  q(£ )]  -  a  w^en 


x  =  a.G  is  determined  from 

—  l — 

P[*2  >  -(m+Tr)/p  +  n p/P]  <  a 


P[xn  <  (m+"r  )/p  +  np/p ]  <  a 


if  a .  <  a  . 
i  J 


if  a .  >  a  .  . 

i  J 


(4-9) 


This  procedure  can  be  repeated  for  the  numerator  of  (4-8).  A 
conservative  result,  which  satisfies 

P[p -dPg+a  £ ( x )  >  e  mp^Pg+a  q ( )  ]  <  a,  is  to  require  n  >  An  +  n^ , 

J  _  aj-  l  _  a.k 


where  n^  is  the  value  for  a  given  by  (4-9)  and  An  satisfies 
Anp( i , j )/P( i , j )  >  dZa/a.  +  d2u ( i , j )/ ( 2a . 2 ) . 


(4-10) 


Applying  these  results  to  the  distribution  used  to  obtain  Figure 

1.  with  e  =  .01  (m  =  4.6)  and  a  =  10  ,  the  required  sample  size  for 

the  denominator  of  (4-8)  is  n  >21,  using  (4-9).  The  value  of  An 
given  by  (4-10)  is  9,  so  that  an  adequate  sample  size  is  30.  This 
rather  small  required  sample  size  is  a  result  of  the  wide  separation 
between  the  three  values  of  A,  and  (to  a  much  lesser  extent)  the 
corresponding  large  differences  in  their  probabilities.  It  can  be 
seen  from  (4-9)  and  (4-10)  that  distributions  for  A  which  have  more 
similar  values  will  require  larger  sample  sizes  in  order  to  achieve 
the  above  bounds,  with  a  requirement  of  n  -*  <*>  as  the  minimum  distance 
between  A  values  converges  to  zero. 

The  gist  of  this  analysis  is  that  the  likelihood  ratio  (4-8) 
converges  to  dPg+a  G^^^a  G  w^en  £  =  ajQ.>  as  sample  size  increases; 

the  rate  of  convergence  depends  on  the  distance  from  a^  to  the  nearest 
value  of  A  not  equal  to  a^;  and  the  probability  ratios  can  be  ignored 
for  large  sample  size.  When  the  samp le  size  n  is  sufficiently  large  to 
assume  equality  in  this  approximation,  then  the  performance  of  the 
d  i  sc  r  e  t  e  -  t  i  me  detector  is  the  same  as  that  of  the  con  t  i  nuou  s- t  i  me 

2 

detector  so  long  as  the  value  of  d  is  fixed. 

For  n  sufficiently  large,  then,  one  can  mimic  the  log- 1  ike  1 ihood 


ratio  for  the  continuous-time  case: 


[dP^+N/dP"](x)  =  £n(x)/A2(x)  -  d2/2A2(x) 


whe  r  e 


A2(x)  =  n"1  2  (x*ek)2/\k. 

k=  1 


*V)  =  2  (x*e  )(S*e  )/X 

k=l  k 


n 


(4-11) 


=  Akek  ,  k=l . n,  and 

k>l)  is  a  complete  orthonormal  set 


i  n 


E 


n 


The  detector  (4-11)  has  previously  been  given  as  an  approximate 
likelihood  ratio  for  large  n  by  Picinbono  and  Vezzosi  [13].  The  above 
analysis  indicates  why  this  is  so,  and  indicates  how  one  can  determine 
how  large  n  must  be  in  order  to  use  the  approximation. 

The  CFAP  detector  now  becomes 

ACFAp(~)  =  ^n(^)/An(x)  (4-12) 

where  An(x)  =  [A2(x)]2. 

For  small  n,  one  may  wish  to  consider  the  CFAP  detector  given  by 


ACFAP(x)  =  fn(x)/an(x) 


(4-13) 


where  ct2(x)  =  (n-1)  *  2 

n  i=l L  A2 


rte*£i>  _  i  ;  (^j}i 


j  =  l  A2 

j 


Since  the  random 


variables  (x«e./-A. :  i  =  l . n}  are  i.i.d.  N(0,a2)  when  A=a,  and  £n(x) 


is  Gaussian,  one  may  wish  to  assume  that  S.e./VX.  =  constant  =  d/Vn 

— 1~  l  i 

for  i=l . n.  The  test  statistic  divided  by  d  then  has  a  t 

distribution  with  n-1  degrees  of  freedom  when  noise  only  is  present; 
this  fact  can  be  used  to  calculate  Pp^  .  Under  this  same  assumption, 
one  can  also  obtain  an  expression  for  P^  for  this  detector  if  the 

o 

distribution  of  A  is  known,  using  the  fact  that  (n  -  1)C7n  is 
chi-square  distributed  with  n-1  degrees  of  freedom.  One  could  use 
these  considerations  to  determine  a  worst-case  value  of  P^  if  the 
distribution  of  A  is  known  to  belong  to  a  specified  family,  while 
maintaining  a  desired  Pp^. 

The  problem  of  detecting  a  signal  in  Gaussian  noise  having 
unknown  scale  factor  is  familiar  in  active  sonar.  A  detailed  treatment 
of  CFAP  detection  for  this  problem  has  been  given  by  Grieve  [7],  who 
obtained  CFAP  optimality  properties  for  (4-12). 
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5.  APPLICATIONS  TO  DETECTION  IN  IMPULSIVE  NOISE  ENVIRONMENTS 

As  previously  noted,  Middleton's  Class  A  univariate  model  is  a 
special  case  of  SIN.  Detection  in  such  noise  has  been  analysed  by 
several  approaches.  In  [16],  Spaulding  and  Middleton  assume 
independent  sampling  and  then  develop  bounds  on  likelihood  ratio 
performance  for  communicating  with  a  known  signal  over  a  channel  in 
the  presence  of  Class  A  noise.  The  Middleton  model  has  also  often  been 
approximated  by  using  only  the  first  two  or  three  terms:  "Gauss-Gauss" 
or  "Gauss-Gauss-Gauss"  noise. 

If  the  mixing  random  variable  of  the  Middleton  model  remains 
constant  over  the  observation  interval,  then  the  results  given  above 
can  be  used  to  provide  detection  results.  It  is  not  necessary  to  have 
independent  sampling,  but  only  to  know  the  covariance  matrix  of  the 
noise  and  the  parameters  U  and  T.  For  detection  of  a  known  signal,  the 
preceding  results  can  be  used  in  several  ways.  They  provide  upper 
bounds  on  detection  performance  by  giving  the  con t i nuous- t i me 
detection  performance.  Secondly,  they  provide  a  method  for  obtaining 
exact  detection  performance  for  the  d i scr e t e- t i me  finite-sample-size 
detectors,  and  provide  a  means  of  calculating  required  sample  size  in 
order  to  simplify  the  detector  structure.  Thirdly,  they  can  be  used  to 
obtain  d i sc r e te- t ime  CFAP  detectors,  as  well  as  upper  bounds  on  the 
performance  of  such  detectors.  Finally,  the  fact  that  the  likelihood 
ratio  detector  can  be  implemented  without  knowing  the  distribution  of 
the  mixing  r.v.  A,  once  the  sample  size  n  is  reasonably  large  (4-11), 
can  provide  a  significant  reduction  of  the  complexity  of  the 
implementation.  One  need  only  adjust  the  threshold  as  a  function  of 
the  parameters  U  and  T,  while  the  operation  on  the  data  is  unchanged. 
Even  this  adjustment  is  not  necessary  if  one  is  willing  to  use  a  CFAP 
detector . 

The  imbedding  of  the  Middleton  Class  A  model  within  the  general 
SIN  model  thus  provides  a  number  of  useful  results.  One  may  note  the 
importance  of  the  con t i nuous- t i me  model,  which  is  often  disregarded  on 
the  grounds  that  it  is  not  relevant  to  practical  signal  detection.  In 
the  present  case.  the  con t i nuou s- t i me  model  provides  useful  upper 
bounds  on  detection  performance  for  both  the  likelihood  ratio  detector 
and  the  CFAP  detectors.  It  also  provides  one  with  a  prac t i ca 1 1 y-u se f u 1 
implementation  and  simplification  of  the  apparently  extremely-compli¬ 
cated  d i sc  re te- 1 i me  likelihood  ratio  detectors,  and  a  rationale  for 
making  this  simplification.  The  notion  of  orthogonal  measures  is 
central  to  these  results. 


6.  EXTENSIONS  TO  THE  SIN  MODEL 


The  SIN  model  is  not  realistic  for  many  situations,  such  as 
observation  periods  where  the  mixing  r.v.  A  cannot  be  expected  to  take 
on  a  constant  value.  A  more  reasonable  model  in  such  situations  would 
be  generalized  sphe r i ca 1 1 y- i nvar ian t  noise,  of  the  form 
N(t)  =  A(t)G(t),  where  now  (A(t))  is  a  stochastic  process  independent 
of  the  Gaussian  process  (G(t)).  This  reduces  to  a  SIN  model  in  the 
univariate  case.  Some  work  has  previously  been  done  for  such  a  model 
[15],  but  general  results  are  so  far  not  available. 
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